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Detecting the time evolution of the community structure of networks is crucial to identify major changes in the 
internal organization of many complex systems, which may undergo important endogenous or exogenous events. 
This analysis can be done in two ways; considering each snapshot as an independent community detection 
problem or taking into account the whole evolution of the network. In the first case, one can apply static methods 
on the temporal snapshots, which correspond to configurations of the system in short time windows, and match 
afterwards the communities across layers. Alternatively, one can develop dedicated dynamic procedures, so that 
multiple snapshots are simultaneously taken into account while detecting communities, which allows us to keep 
memory of the flow. To check how well a method of any kind could capture the evolution of communities, 
suitable benchmarks are needed. Here we propose a model for generating simple dynamic benchmark graphs, 
based on stochastic block models. In them, the time evolution consists of a periodic oscillation of the system’s 
structure between configurations with built-in community structure. We also propose the extension of quality 
comparison indices to the dynamic scenario. 

PACS numbers: 89.75.Fb, 89.75.Hc 


1. INTRODUCTION 

The analysis and modeling of temporal networks has re¬ 
ceived a great deal of attention lately, mainly due to the in¬ 
creasing availability of time-stamped network datasets M. 
A relevant issue is whether and how the community structure 
of networks |@] changes in time. Communities reveal how 
networks are organized and function, hence major changes in 
their configuration might signal important turns in the evolu¬ 
tion of the system as a whole, possibly anticipating dramatic 
developments such as rapid growth or disruption. 

Indeed, there has bee n a g reat deal of activity around this 
topic in recent years ll7l- [0 . However, most investigations 
lack strength on the validation part, which typically consists 
in checking whether the results of the algorithm “make sense” 
in one or more real networks whose community structure is 
usually unknown. Actually, it is not obvious what exactly it 
means to test an algorithm for detecting evolving communi¬ 
ties. One idea could be that of correctly identifying the com¬ 
munity structure of the system at each time stamp. However, 
during the evolution of the system several events that affect 
the network structure may occur, such as the creation or dele¬ 
tion of nodes or links or link rewiring, and it is not possible 
to detect these events by observing a single time-stamped net¬ 
work, they require taking into account the whole picture to be 
properly understood. 

To explicitly keep track of the history of the system, an op¬ 
tion is to consider multiple snapshots at once. For instance, 
in the evolutionary clustering approach the goal is to find 
a partition that is descriptive of the structure of a given snap¬ 
shot as well as correlated to the structure of the previous snap¬ 
shots. Furthermore, the added value of any approach should 
be the ability to promptly detect changes in the community 
structure of the network. It would be possible to verify this 
if there were suitable benchmark graphs with evolving clus¬ 
ters, but those are still missing. This paper aims at filling this 


gap. We propose a model, derived from the classic stochastic 
block models ifisl - l^ . that generates three classes of dynamic 
benchmark graphs. The objective is to provide time-evolving 
networks, such that at each snapshot the partition into commu¬ 
nities is well defined according to the model. To keep things 
simple we consider a periodic evolution such that the same 
history repeats itself in cycles and is invariant under time re¬ 
versal. The analysis of the community structure evolution for 
the designed benchmarks reveals that approaches exploiting 
the flow of system configurations might be more accurate in 
detecting the evolving community structure than methods that 
consider the snapshots independently. Note that in real data 
sets this evolution can be sharp and bursty, however in these 
cases the challenge of finding the community structure is not 
well defined, because the range of timescales makes the meso¬ 
scopic structure clearly disconnected. 

The paper is structured as follows. In Section 2 we de¬ 
scribe the model to generate the benchmark networks, Sec¬ 
tion 3 introduces measures of comparison between dynamic 
clusterings, Section 4 shows an example of the application of 
a dynamic multislice algorithm on the proposed benchmarks. 
Section 5 gives a summary and reports our conclusions. 


2. MODEL DESCRIPTION 

The model we propose for generating networks with evolv¬ 
ing community structure is based on the classic stochastic 
block model (SBM) ifl^ . It works as follows. A network 
is divided into a number q of subgraphs and the nodes of 
the same subgraph are linked with a probability pin, whereas 
nodes of different subgraphs are linked with a probability 
Pout- Such probabilities match the link densities within and 
between subgraphs. Supposing subgraphs of equal size, if 
Pin > (q — l)Pout the resulting subgraphs are communities, 
as the (expected) link density within subgraphs exceeds their 
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connectivity to the rest of the graph. The generation of sam¬ 
ples from this model has a built-in efficiency: If there are 
TOniax pairs of nodes, the actual number of edges is drawn 
from a binomial distribution with parameters rumax and p. 
Then, we simply place this number of edges randomly to gen¬ 
erate a sample from our ensemble. 

The model implements the two fundamental classes of dy¬ 
namic processes: growing or shrinking and merging or split¬ 
ting of communities. By combining these two reversible types 
of processes one can capture the most common behaviors 
of dynamic communities in real systems. We are then able 
to generate three standardized benchmarks: One consists in 
communities that grow and shrink in size (keeping fixed the 
total number of nodes of the network), while the second con¬ 
siders communities that merge and split. The third one is a 
mixed version of the previous two, and consists of a combina¬ 
tion of the last four operations. 


A. Grow-shrink benchmark 


This process models the movement of nodes from one com¬ 
munity to another. At all times, two communities are kept in 
a SBM ensemble with intracommunity link density pin and 
intercommunity link density Pout- However, the number of 
nodes in the two communities changes over time. In the basic 
process, we have a total of 2n nodes in two communities. In 
the balanced state, these are split into two equal communities 
of n nodes, which we call A and B. At the extremes, a frac¬ 
tion / of nodes in community A will switch to community B. 
If we take n\ as the size of community A, then the number of 
nodes in the community B is Ur = 2n — ni. Then, at time t 
the number of nodes in community A is 


UA = n — nf [2x{t + r/4) — 1] (1) 

with the t/4 phase factor specifying equal sized communities 
at f = 0. The function x{t) is the triangular waveform 


J 2t*, 0 <t* < 1/2 

\ 2-2t*, 1/2 < 1 


( 2 ) 


(with t* = {t/r + (j)) mod 1), which controls the time pe¬ 
riodicity. The constant is a phase factor with </ = 0 
for the q = 2 case and specified otherwise in the case of 
q > 2. With this formulation, we get communities of sizes 
(n,n), (n — nf,n + nf), (n,n), and (n -|- nf,n — nf) at 
ijr mod 1 = 0, j, and |, respectively. In practice, all 2n 
nodes are sorted in some arbitrary order, and the first ua nodes 
are put into community A, and the others into community B. 
Say these nodes are i = 0 to i = 2n — 1. 

After the community sizes are decided, the edges must be 
placed, taking into account that it is necessary that we keep 
the two communities in the proper SBM ensemble with equal 
and independent link probability pin at all times. The inde¬ 
pendence of pairs provides a hint on how to do this. When 
a node j is moved from community A to B, all the existing 
edges of node j are removed. Then an edge is added between 
j and each node in the destination community B with equal 
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FIG. 1: (Color online) Schematic representation of the benchmarks, 
(a) Grow-shrink benchmark with q — 2. We begin with two equal¬ 
sized communities, and over a period of r nodes move from the bot¬ 
tom community to the top, then from the top to bottom, then back 
to the symmetric state, (b) Merge-Split benchmark with q — 2. We 
begin with two communities, and over a period of r we linearly add 
edges until there is one community with uniform link density, then 
reverse the process, (c) Mixed benchmark with 5 = 4, combining 
the merging and growing processes. 


and independent probability pin and between j and each node 
in community A with equal and independent probability pout, 
thus the ensemble is maintained. Conveniently, all edges can 
be pre-computed and stored to allow a strictly repeating pro¬ 
cess, with the state at time t being identical to the state at time 
t + T,in analogy to the merging process. 

A special case that we need to cope with is the situation 
where / is very high and pin is very low. When this happens, 
a community shrinks too much and it may become discon¬ 
nected. In order to preserve the ensemble, we do not take 
actions to totally eliminate this possibility, but we ensure that 
n(l — /)pin 3> 2 to reduce the probability of disconnection. 
However, if a disconnection occurs, the process is aborted and 
re-run. Figure [TJ a) is a sketch of the grow-shrink benchmark 
for the case q = 2. 

B. Merge-split benchmark 

This process models the merging of two communities. In 
this setup, we have a set of 2n nodes, divided into two com¬ 
munities of n nodes each. Each of the two initial communities 
has a link density of pin, where those links are placed at initial¬ 
ization and kept unmodified over time. There are two extreme 
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states: the unmerged and the merged state. In the unmerged 
state, all possible pairs of nodes between the two communities 
have an edge with probability Pout- This means that the net¬ 
work still has a connected component, but the nodes form two 
communities. In the merged state, all possible pairs of nodes 
between these two communities have an edge with probability 
Pin, which implies that all pairs of nodes in the network have 
the same link density pi„, the previous two communities are 
now indistinguishable, and thus we have one large community 
with 2n nodes. 

The merge-split process is a periodic interpolation of the 
merged and unmerged states. The numbers of intercommunity 
edges in the unmerged state mum and in the merged state mm 
are first picked from a binomial distribution consistent with 
the binomial distribution parameters ti? and pout or pin. All 
possible intercommunity edges are placed in some arbitrary 
but random order, and the first 

m*{t) = [I - x{t)]rnnm + x{t)rn^ (3) 

edges are selected to be active at time t. The effective intra¬ 
community link density is = m*{t)/n'^. The param¬ 

eter x{t) is the triangular waveform from Eq. (|2l). In prac¬ 
tice, this means that at time tjr mod 1 = 0 the communi¬ 
ties are unmerged and mod 1 = 1/2 the communities 

are merged, with linear interpolation (of the number of edges) 
between these points. Since the possible edges are ordered 
only at initialization, the process is strictly periodic, that is, 
the edges present at time t are identical to those present at 
time t + T. 

One may think that the communities are fully merged at 
the extreme of this process, where the intercommunity link 
density is Pj*„ter = Pin t = x/2). However, due to the 
detectability limit of communities in stochastic block models, 
this is not the case 1^ . Even when pout < Pin, it can be that 
the configuration is indistinguishable from one large commu¬ 
nity. Eollowing El, at the point 


Pin - Planter = y “ + ^^ter) (4) 

we consider the communities to be merged into one for all 
practical purposes. While this limit is strictly speaking only 
accurate in the sparse and infinite-size limit, it is an adequate 
approximation. A schematic representation of the merge-split 
benchmark, for g = 2 is shown in Eig.lHb). 


C. Mixed benchmark 

This process is a combination of the merging and growing 
processes. In this process, there is a total of 4n nodes with two 
merging-splitting communities {2n nodes) and two growing- 
shrinking communities (2n nodes). The intra-community 
links are managed with the same processes as above with 
phase factors of ^ = 0 for both. If there are g = 4a > 4 
total communities, then the pairs of communities involved 
in merging and growing process have phase factors </> = 
0, T, Between the pairs of nodes that belong to 


different processes, an edge exists with a probability of Pout- 
Eigure[Tlc) exemplifies the mixed benchmark when g = 4. 


3. TIME-DEPENDENT COMPARISON MEASURES 


The assessment of the performance of any clustering algo¬ 
rithm requires the use of measures to define the distance or 
similarity between any pair of partitions. The list of avail¬ 
able measures is long, including e.g. the Jaccard index 12^ . 
the Rand index ll2^ . the adjusted Rand index 1^ . the nor¬ 
malized mutual information 12^ . the van Dongen metric 
and the normalized variation of information metric 1^ . All 
of them have in common the possibility of being expressed 
in terms of the elements of the so-called confusion matrix or 
contingency table, thus we focus first on its calculation. Let 
C = {C^\a = andC' = {C'^,\a' = 

be two partitions of the data in r and r' disjoint clusters. The 
aa'th component of the contingency table M accounts for the 
number of elements in the intersection of clusters Ca and C^,, 

= ianc;,|. (5) 


The sizes of the clusters simply read Ua = ic'd = tnriaa' 
and nd = ICdl = and the total number of ele¬ 
ments is N = K' = Ea Ea' With 

these definitions at hand, one can calculate the Jaccard index. 


J = 


EE 

Oi a' 


^aoi' 

2 


E 



the normalized mutual information index. 


r, 1 ^Xriaa' 

-2 2^ 2^ maa' log ■ 


UaU' 


NMI =-- 

5^rralog^ + ^<, log^ 

O' a' 

and the normalized variation of information metric. 


Nvi=^yy 

" TV 

a a' 


logN 


rriaa' , (Waa')^ 

log 


nati', 


( 6 ) 


(7) 


( 8 ) 


where, by convention, 0 log 0 = 0. 

In the case of evolving networks we have to compare two 
sequences of partitions {C{t)\t = 1,... ,T} and {C'{t)\t = 
1,..., T}, a task that can be performed in different ways. The 
simplest solution is the independent comparison of partitions 
at each time step, by measuring the similarity or distance be¬ 
tween C{t) and C'{t) for each value of t, thus obtaining ,e.g., 
a Jaccard index J{t) for each snapshot, see Eig. |2ja)- How¬ 
ever, this procedure discards the evolutionary nature of the 
communities: We would like to quantify not only the static 
resemblance of the communities but also if they evolve in a 
similar way. 
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FIG. 2: (Color online) Construction of the contingency tables niaa' ■ 
On top we represent three steps (columns) of the time evolution of a 
network of four nodes (rows), and the partitions in communities we 
want to compare, e.g. the planted partitions from the benchmark and 
those obtained by a certain algorithm. To compare these two parti¬ 
tionings, we can do it as it is depicted in (a), which takes only one 
snapshot at a time (a — 0), or as in (b), building a contingency table 
where the entries consider two snapshots at the same time (a = 1). 
Afterward, the measures (NVI, NMI or Jaccard index) are calculated 
from these tables. 


Our proposal consists in the definition of windowed forms 
of the different indices and metrics, obtained by considering 
sequences of consecutive partitions, i.e. time windows of a 
predefined duration cr. In Fig. ^h) we show the compari¬ 
son between individual snapshots and sequences of length 2. 
For example, let us consider the time window formed by time 
steps from ttot + a. Every node belongs to a different cluster 
at each snapshot, and this evolution can be identified as one 
of the items in 'D{t; a) = C{t) x C(t -F 1) x • • • x C{t + a) 
for the first sequence of partitions, and V'{t]a) = C'{t) x 
• • • X C'{t + a) for the second one, where the multiplication 
sign denotes the Cartesian product of sets. Since the num¬ 
ber of nodes is N, there are at most N different nonvoid sets 
Da{t',cr) € V(t;(T) and the same for cr) € 2?'(t;cr). 

For example, in Fig. |2lb), the combinations of partitions (ex¬ 
cluding empty sets) are 'D{t = 1; tr = 1) = {aa, AB, BB, CC} 
and V'{t = 1; tr = 1) = {aa, BB, CC}. Next, we may define 
the elements of the contingency table for this time window as 


maa'{t-,cr) = \Da{t-,cr) n 


(9) 


Ha (f; u) = \Da (f; O') 1 = ^ rriaa' (i 

Ct' 

oCr) , 

(10) 

n'a' (i; ct) = \D'^> (f; ct) 1 = ^ ( 

t;a), 

(11) 


and 


N = Y^ cr) = ^ <,(f; cr) = ^ ^ 


Too 


/(f;cr) 


( 12 ) 

Finally, we may use Eqs. (|6]l-(l8]l to calculate the correspond¬ 
ing windowed Jaccard index J(f;cr), windowed normalized 
mutual information index NMI{t; cr), and windowed normal¬ 
ized variation of information metric NVI{t; a), respectively. 
Of course, the windowed measures reduce to the standard 
static ones when cr = 0, and are able to capture differences 
in the evolution of communities that cannot be distinguished 
using their classical versions (see the Appendix). 

We will see in the next section how the plots of NVI{t; cr) 
are valuable to compare different algorithms and to detect in 
which moments of the time evolution they differ. Neverthe¬ 
less, it is also convenient to have a single number to quantify 
the overall deviation. A simple solution is the use of the aver¬ 
age squared errors, which is expressed as follows: 


Ei{(r) = ^^[J(f;cr) - 1]^ (13) 

1 ^ 

Enmi{<j) = (14) 

1 ^ 

ENViii) = -J2NVI{t;af. (15) 


which accounts for the number of nodes following the same 


T 

t=i 

Eor simplicity and for its superior mathematical properties 
(see lEsIl ) we have chosen to use only the NVI metrics in the 
rest of this article. See Supplemental Material for the results 
using the normalized mutual information and the Jaccard in¬ 
dex. 


4. RESULTS 

Here we show an example of the application of a com¬ 
munity detection algorithm, designed to take into account 
the evolution of complex networks, to reveal the community 
structure in our benchmarks. The chosen method is the multi¬ 
slice algorithm in ifl^ . which extends the definition of mod¬ 
ularity to multilayer networks. In their representation, each 
layer (slice) consists of a single network at a particular time. 
The slices are connected between them by joining each node 
with its counterpart in the next and previous layer, and this 
link has a specified weight w, equal for all links of this kind, 
which acts as a tuning parameter. Eor a; = 0, no connection 
between slices is considered and the algorithm is performed 
statically. As this value increases, more consideration is given 
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to the communities across layers. The formulation includes 
an additional parameter 7, which accounts for the tuning of 
the resolution at which communities are found, in the manner 
of lUit]. In this work, we have used the code available in 1^ . 
setting the resolution parameter 7 to 1 and varying the inter¬ 
slice coupling u!. 

The benchmarks used to put to test this algorithm are gen¬ 
erated using the model proposed in this paper. For the sake 
of simplicity, we generate three simple standard benchmarks, 
one for each basic procedure: grow-shrink, merge-split and 
mixed. The grow-shrink benchmark consists in a network 
with q = 2 communities, where each community has ini¬ 
tially n = 32 nodes (therefore the total size of the network is 
N = 64), with Pin = 0.5, Pout = 0.05, / = 0.5, and r = 100 
time steps. The merge-split test has a variable number of com¬ 
munities; in this paper we use the parameters q = 2 commu¬ 
nities of size n = 32 each, with pin = 0.5, Pout = 0.05, and 
T = 100. The mixed benchmark, a combination of the previ¬ 
ous two, has q = A communities of n = 32 nodes each, and 
the other parameters are set as in the previous cases. 

Figure [3] shows the planted partitions for the three bench¬ 
marks and the results from the multislice algorithm at three 
different interslice couplings: In the extreme case a; = 0 
slices are considered independently, uj = 0.5 is an interme¬ 
diate value that provides good results, and uj = 2 provides 
an example of the partitioning obtained when using strong 
coupling between layers. It can be seen that for a; = 0 we 
obtain a different partition for each time step, and the results 
are mostly correct, except for those configurations of the sizes 
of the communities where the preference of modularity for 
equal-sized communities hampers the process (see the first 
column of Fig. [^i. Higher values of w request higher consis¬ 
tency through time, which implies that the number of misclas- 
sified individual snapshots is reduced. We have also compared 
the multislice method with a temporal stability approach 
and the results obtained are very similar to the results of the 
multislice algorithm obtained at w = 0.5. 

To quantitatively evaluate the results, we use the windowed 
measures introduced in the previous section. We calculate the 
measures between the partitions obtained by the algorithm and 
the planted ones, for three values of the time window. When 
the time window is of size 1 (cr = 0), each snapshot is con¬ 
sidered independently, that is, we have computed the measure 
between the planted partition at t and the algorithm’s result 
at t, repeating this process until t = t. Instead, with the time 
window of size 2 (cr = 1), we evaluate the evolution of the par¬ 
titions during two consecutive time steps, following the same 
process but comparing the planted partitions at [t, f -f 1] with 
the algorithm’s results at [t, f -f 1]. This formulation is more 
restrictive, as we impose, in addition to the condition that the 
nodes must belong to the same community, that their evolu¬ 
tion during two consecutive time steps is also the same. Sim¬ 
ilarly, we have also analyzed time windows of size 5 (cr = 4) 
to check the quality of the detected community evolutions at 
longer ranges. 

Figure|4]shows the results for the NVI. We observe that, for 
the grow-shrink benchmark, the error is large for ui = 0, but 
becomes almost zero at a; = 2. Moreover, the values of the 


NVI increase with the size of the time window for a; = 0 and 
u! = 0.5, but in a larger amount when the parameter corre¬ 
sponds to the static version of the multislice algorithm. This 
means that the interslice weight is helping to find the persis¬ 
tence of nodes in their communities, as expected. The merge- 
split benchmark shows an almost identical bad performance 
for the three values of uj at windows of size 1, but uj = 2 does 
not make it worse when the size of the window increases, un¬ 
like the other two. The mixed benchmark is quite neutral, with 
just a small difference from uj = 2. Finally, the NVI squared 
errors reported in Table U and calculated using Eq. ( fTSb are 
in perfect agreement with this analysis. The results using the 
NMI and Jaccard indices (see the Supplemental material) also 
support these observations. Thus, we may conclude that, in 
this case, the use of memory to track the evolution of commu¬ 
nities is convenient, but the trade-off between the continuity 
of the community structure and its static relevance must be 
carefully adjusted. 

5. CONCLUSIONS 

We have presented a simple model based on the stochas¬ 
tic block model that allows for the construction of time- 
dependent networks with evolving community structure. It 
is useful for benchmarking purposes in testing the ability of 
community detection algorithms to track properly the struc¬ 
tural evolution. We have also introduced extended time- 
dependent measures for the comparison of different partitions 
in the dynamic case, which allow for the observation of differ¬ 
ences between the outcome of the algorithms and the planted 
partitions through time. 

Our code for benchmark generation and the time-dependent 
comparison indices is available at and released under the 
GNU General Public License. 
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Appendix 

Distinguishing community evolutions with windowed measures 

Eigure |5] shows an example in which, according to the 
planted partitions, the eight nodes of a network are divided in 
two communities of four nodes each and these partitions re¬ 
main constant throughout the three times steps of the network 
evolution. Two different community detection algorithms find 
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FIG. 3: (Color online) Results of the application of the multislice community detection method to the three benchmarks proposed (in columns). 
The first row corresponds to the planted partition of each benchmark, while the three remaining rows are the partitions obtained by the multislice 
algorithm for different values of the interslice parameter a;, which is the weight of the coupling between different instances of the same nodes 
across layers. When a; = 0 the slices are disconnected and then the community detection analysis is done for each slice separately. As this 
value increases, more importance is given to the evolving nature of the problem, and communities across slices are found. In each plot, the 
vertical axis corresponds to the index of nodes in the network, while the horizontal axis represents the time. The color of each pair {node, 
time} is the label of the community at which the node is assigned at that specific time. 


the communities evolutions represented in Figs. 13 a) and|5lb), 
which are characterized by the assignment of just one node 
to the wrong community at each time step. In Fig. |3a) this 
node is the fourth one during the three time steps, while in 
Fig-Bb) they are the second, the third, and the sixth, respec¬ 
tively. Since the nature of the mistake is the same at all time 
steps, the comparison of the planted and algorithm partitions 
with a time window of size 1 generates equivalent contingency 
tables, thus the standard comparison measures do not change 


in time, with a constant value of the NVI equal to 0.2856. 
However, if we take into account a time window of size 3, the 
two evolving community structures detected by the algorithms 
are different, yielding structurally different contingency tables 
and values of the NVI equal to 0.2856 and 0.3852, respec¬ 
tively. Therefore, the conclusion is that windowed measures 
give complementary information for the comparison of time 
evolving community structures due to their capacity to take 
into account several snapshots at the same time. 
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FIG. 4: (Color online) Plots of the normalized variation of information (NVI) between the planted partition and the results of the multislice 
algorithm in Fig. [3 for three different interslice couplings and for the three benchmarks proposed. The NVI is computed using the proposed 
evolving formulation and for three different window sizes; 1, 2 and 5. There is a column for each benchmark, and a row for each time window 
size. 
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NVI squared error 


Grow-shrink 

Merge-split 

Mixed 
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0.0851 
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p 

O 

II 

3 
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0.2146 
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5 
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0.1922 
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TABLE I: The NVI squared error, for each method tested and each benchmark in Fig.[^ considering three different time windows. 
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FIG. 5: (Color online) Example of the comparison of a planted evolving community structure with the results from two different algorithms. 
The NVI values are shown below the corresponding contingency tables for time windows of sizes 1 and 3. 
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Time 

Jaccard squared error 


Multislice 

window 

Grow-shrink 

Merge-split 

Mixed 


1 

0.0720 

0.3345 

0.0307 

o 

o 

2 

0.1365 

0.4840 

0.0303 


5 

0.2336 

0.6272 

0.0325 


1 

0.0293 

0.3193 

0.0276 

OJ = 0.5 

2 

0.0546 

0.4608 

0.0282 


5 

0.1105 

0.6013 

0.0303 


1 

0.0019 

0.3326 

0.0360 

II 

to 

o 

2 

0.0014 

0.3605 

0.0374 


5 

0.0147 

0.4488 

0.0421 


TABLE II: Jaccard squared error, for each method tested and each benchmark, considering three different time windows. 



Time 

NMI squared error 


Multislice 

window 

Grow-shrink 

Merge-split 

Mixed 
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0.0337 

0.4932 

0.0067 
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0.0621 

0.4806 

0.0063 
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0.1022 

0.4855 

0.0059 


1 

0.0143 

0.4896 

0.0060 

w = 0.5 

2 

0.0262 

0.4753 

0.0059 


5 

0.0479 

0.4790 

0.0055 


1 

0.0065 

0.4951 

0.0094 

UJ = 2.0 

2 

0.0041 

0.4891 

0.0094 


5 

0.0041 

0.4825 

0.0100 


TABLE III: NMI squared error, for each method tested and each benchmark, considering three different time windows. 
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FIG. 6; Plots of the Jaccard Index between the planted partition and the results of the multislice algorithm for three different interslice couplings 
and for the three benchmarks proposed. The Jaccard index is computed using the proposed evolving formulation and for three different window 
sizes: 1, 2 and 5. There is a column for each benchmark, and a row for each time window size. 
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FIG. 7: Plots of the Normalized Mutual Information (NMI) between the planted partition and the results of the multislice algorithm for three 
different interslice couplings and for the three benchmarks proposed. The NMI is computed using the proposed evolving formulation and for 
three different window sizes: 1, 2 and 5. There is a column for each benchmark, and a row for each time window size. 











































